/ ^yrefc^Tableldata 
; Entry of these amendments is respectfully requested. . A A 

Applied respectfully request that, upon allowance of product claims, method chums 4-12 and 

iSshouldbere^ 

■■GuidancconTrea^^ 
* " USC § 103(b)" sets forth the rules, upon allowance of product claims, for rejoinder of process claims 

. covering the same scope of products. 

Claims Rejections - 35 USC §§ 101 and 112 

1 The Exammer has rejected 
invention is not su^^ 

specific and are generally applicable to any polynucleot^ The Exanuner sutes that the specificatxon 
discloses many potential uses lor the polynucleotide including gene expression,. as a probe, and "for 
expressing noRNA and protein, or conversely to block transcription or translation of the mRNA , ^ 
Additionally, the Examiner states that these are "non-specific uses that are applicable to po.ynucleottdes tn 
genera, andnotpanicul^^ Father, the Examiner states that 

■■tteclaim^^ 

creatine kinase M, and has no asserted or identified specific and substantial utihues". Furthermore, the 
Examiner states that, the claimed invention is ^neric m nature and applicable to a m^ 

.for the compounds. This rejection is traversed. • " 

. ' Applicantshereby submit DECLARATION UNDER 37 CFR 1 .132 OF MIGHAEL G. WALKER. A* 

, shown in me Declanrfon and now irtl jm- in the art, the clahned cambin^n^diagnosHof _ 
disorders of cardiac rouble function. In the Dec^ 

Association, or GBA method, out of a total of 45,235 assembled sequences, only 4S novel . | ; : 

Received 



lines %8. Thus, the Declaration presents evidence from an inventor who is also a person sktiled m the art 
~- to^portApp^ 

cardiac ^'lia^M^^^^ ^ ™? 
with known diagnostic and prognostic markers. 

In farther support, Dr. Walker presented a recent article by Thompson et aL (2002, •'Identification 

^dConfinn^^ 

theGBA method for predicting coexpression of genes (see p. 1518 of Thompson etaL). In particular, as 
presented in the DECLARATION, the Thompson article establishes that a set of genes coexpressed in 
breast and ovarian cancer as determined by GBA analysis were indeed coexpressed as determined by 
> rc al-time RT-PCR andysis of ml^A extracted from ceU Unes derived from breast tissue (s^ 1520 

andKgure2ofTho m p S onet a l.). It is further noted mat the Thompson study used me memod of analysts 
of me instant application applied to datafrom the GenBankEST database, data which is also included in 
the LIFESEQ database. Therefore, based on the arguments presented above, the teachings in the. 
specification, the Thompson article and the DECLARATION UNDER 37 CFR U32 OF MICHAEL G. WALKER, 
AppUcantssubmkmatoneskil^ 

characterization mat the claimed combination is diagnostic of disorders.of cardiac mnscle function and 

has specific and substantial utility^ '\ 

Applicants respectfully point to the information supplied in the DECLARATION UNDER 37 CFR 1.132 
OF MICHAEL G. WALKER which clearly demonstrates that SEQ ID NO:41 was selected as a 
representative for the combination and as a surrogate marker for the very highly associated creatme 
kinase M, a known diagnostic marker for cardiac injury. The.utility for me combination, SEQ ID NO:4l, 
or an antibody that specifically binds the gene product of SEQ ID NOi41 is diagnosisof disorders of 
~: cardiac muscle function as defmedinme specification on page A Ime^ 

DECLARATION also states that the prognosis of cardiac patients is greatly improved with early diagnosis 
and treatment. Therefore, me present s 
surrogate markers in that diagnostic pr^ess ^ 
. function of the gene or the encoded protein. " . ';,,-,-,.;;/: 1p 



, den,o M .n» i onof^.yia-no.s^ 

F 2d'853~ 85(5-57. 206 USPQ 881, 883-84): -For**"" 016 ' and;jn contrast to criminal cases, the applicant 
• • does not b».e to pro** e».*nee st.ff«*nt to establish that an assemd »Ulity is true beyond a :., 

section VTJ; CCPA 1980). " ' ."' ^ j 

Applicants respectfully submit by virtue of medisclosu^^ 
Declaration of Dr. Walker who is clearly a person skilled in rhe art, that che claimed invention is more 
than^ofp-vidinganide^ 

disorders of cardiac muscle function. < . . 

With the DECLARATION UNDER 37 CFR 1.132 OF MICHAEL G. WALKER* the Thompson article, the 
arguments, and explanations above, Applicants respectfully request that the rejection of claims 1-3, 13 and 
14 under 35 USC § 101 be withdrawn. 

2. The Examiner has rejected claim, 1-3 and 13-14 under 35 USC § 1 12, first paragraph, based 
on the rejection of these claims for lack of utility under 35 USC § 101. 

Applicants respectfully .ubmit that they have provided sufficient evidence to clearly demonstrate 
that the 35 USC § 101 utility requirement has been satisfied. Therefore, Applicants respectfully request 
, withdrawal of the rejection of claims 1-3 and 13-14 under 35 USC § 112, first paragraph. 



yg:.^ " Early : 

notice to that effect is earnestly solicited. . y 

If the Examiner contemplate, other action or if a telephone conference would expedite allowance r 
of the claims, Applicants invite the Examiner to contact Applicants' Agent at (650) 855-0555. - 

AppUcants believe that no fee is due with this communication. However, if the USFTO 
determines that a fee is due, the ^ Commissioneris hereby authorised* charge Deposit Account No. 09- 
0108. 



Respectfully submitted, 
INCYTE GENOMICS, INC. 
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Yu-Mei Eureka Wang 
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3160 Porter Drive 
Palo Alto, California 94304 
Phone: (650) 855-0555 
Fax: (650) 849-8886 
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IN THE CLAIMS 

Claim 2 has been amended as follows: 

2. CTwicc Amended) An isolated polynucleotide comprising a nucleic acid sequence of SEQ ID NO:41 
or [and] the complement of SEQ ID NO:41. 
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nKCT.ARATIO" TTMflF.H 37 f TO 1 132 OF M"TT AKT, G. WALKER 

1, Michael G. Walker, declare: * 

1. I received my doctoral degree from Stanford University, Stanford, CA in 1992. I have held the 
position of Consulting Professor in the Depaitment of Medicine at Stanford University since 1995 and 
nave consulted to Incyte Genomics since 1996. My work has involved the development of analytical took 
for the characterization and annotation of molecules by their expression in relation to cellular function, 
disease, and metabolic pathways. I am the first named inventor on the pending application. 

2. The applfcation ^ 

function, and specifically for cardiac injury, which were identified by their co-expression with known 
diagnostic and prognostic markers, 

3 I understand that the OFFICE ACTION presents 35 USC §§101 and 112 rejections of claims: 1-3, 13, 
and 14 directed to the described invention as "not supported by a specific utility because the disclosed 
uses of the polynucleotide are not specific and generally applicable to any polynucleotide". 

4 The purpose of my declaration is to support the asserted utility of the-identified polynucleotides as 
diagn6 S ti CS ? To that end, I will discuss the attached journal article of Thompson fit it (2002; Identificauon 
and affirmation of a Module of ^expressed Genes, Genomics Research 12^17-1522) andthe data - 
presented in the application. ^ , 
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data (all genes similarly expressed across all tissues). v V 

b) The data set was tested and found to be reliable using ubiquitously expressed genes and known ■ ■ 
coexpressed genes. J ^ " v "" '\ ■ v " 

c) A novel functional module was found to be involved in breast cancers. 

d) Quantitative expression profiles of six of the genes identified in the novel functional module were 
confirmed using reverse transcriptase real-time PCR with four different cell lines derived from mammary 
epithelial tissue. 

e) Published information on these six genes revealed interactions in two distinct classes: 
transcriptional control and ubiquitin proteasome pathway. Thompson et aL suggest that three of the genes 
in the first class regulate transcription of the three genes in the second class. 

f) Thompson et ah stated that their work focused on a small subset of the available data and is 
expected to lead to the identification of many more functional modules. 

X I will now point out the parts of the specification that support the asserted utility, that the claimed 
combination is diagnostic of disorders of cardiac muscle function. >> * 

a) The known genes used in GB A were characterized as to function and disease indication in 
EXAMPLE VV 

b) As recited in EXAMPLE VI, "GBA identified 48 novel polynucleotides from a total of 45,233 ; 
assembled sequences that showed strong expression and association with the known cardiac muscle- ^ 
associated genes". Contrary to the Examiner's statement, the asserted utility is not applicable to all 
polynucleotides since 45,185 sequences were not significantly expressed in or associated with disorders of 
cardiac muscle function. 

c) The known arid unknown genes, the p-value for their highest association, and their disease 
indication were summarized in a table at the end of EXAMPLE VI. It is clear from the table that cardiac 
injury is the broadest diagnosis using the claimed combination of polynucleotides and that specific 
problems such as atrial fibrillation and myocardial infarction are further identifiable using the expression of 
their associated known and unknown markers.- » > 

d) SEQ ID NO:4l was selected as a representative for the combination and as a surrogate marker 

for the very highly associated creatine kinase M, a known diagnostic marker for cardiac injury, ; v .,;: , 
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8 Finally; it is now well known that the prognosis of cardiac patients is greatly improved with early 

dia^osis and rreatrnent The present sp^ 

surrogate markers in that diagnostic process without even knowing a Eriori the name, stnxcture, or / 

function of the gene or the encoded protein: 

I hereby declare that all statements made herein are true and that they are based on my own 

knowledge, information and belief. These statements are made with the knowledge that willful false 
statements are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful false statements may jeopardize the validity of this application or any 
patent issued from it 

Date: P ^ ^ 

Incyte Genomics 
3160 Porter Drive 
Palo Alto CA 94304 




lichael Q. Walker, PhD 
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of Coexpressia Genes 



H. Garrett R. Thompson, 3 Joseph W. Harris,' Barbara J. Wold/ Stephen R. Quake/ 

and James P. Brody 3 ' 4 " ~ ; ; - _ _ 

Wesynchestoe a *P^^ 

analyze this dau «t *nd Identify »«*^ g?ta .iff In — from lung 

(K5MD7, SQSTMI, FL»OIII)L.Vfo confirm chese obsemtions of oppression by real-time RT Ft* anaiys* 
mftNA extracted from four model breast epithelial cell lines. 



Molecular studies of cellule functions have led to broad 
knowledge of cellular processes. Most cellule processes are 
the result of molecules ^ 

activity of individual molecules. The Study of W 
keoup* of genes has been termed modular cell otology 
l^artwell et al. 1999). We axe interested In Identifying func- 
tional modules. , . 

Technical comtraints make the study of individual genes 
much easier than Identifying interacting molecules. One ap- 
proach to identify modules is to examine genome scale ex- 

^^Geiies cocxpressed In manv different tissues, under both 
normal and diseased conditions and at different times during 
development are candidates for forming functional modules. 

There are different types of experimental gene expression 
data sets available for u$e to identify modules. DNA rmcraar- 
ray experiments measure mfcNA expression levels onaknown 
s«of ffenesbyway of hybridization. A fluorescent tag is put 
on an unknown single-stranded DNA molecule. A*t of 
slnrie-stranded DNAs of known sequence la Immobilized 
onto a surface at known locations. The unknown fluores- - 
- cently tagged sample is allowed to hybridize to its compli- 
mentary immobilized strand. The surface is scanned to create 
a fluorescent image. The intensity of the fluorescence i* a 
measure of the concentration of DNA in the sample. „ 

Substantial DNA microarray data sets exist, but are not 
easy to compare between different laboratories- These mea- 
surements are made relative to a control and the numbers are 
reported as a fold difference relative to the control. The spe- 
cific choice for a control varies widely between different labo- 
ratories. Most troubling is the lack of any reported error m 
these measurements. Hence, there is little systematic informa- 
v tion available on how certain the experimenter 1$ Of trie re- 
ported value. 1 . r 

Similar expression data measurements can be made by. 
DNA sequencing. Message RNA molecules isolated from a tis- 
sue are reverse transcribed into cDNA and cloned into Eh*- 
richia coll vectors to generate a library. Random clones are, 

: ^ ^£<urr^tKmd(n^ " : « ul hm: . ", . : - ' . ,; J :; ••' '£ ■ ■> ; ; .■, ■ " 



sampled from the library and a few hundred base pairs are 
sequenced from each. These are known as expressed sequence 
tags (ESTs). The sequence read from each done Is generally 
sufficient tp identify the gene when cross referenced to a con- 
sensus sequence data set . _ % L . 

The UnlGene data set (Schuler et al. 199*) groups the 
large number of publicly available DNA sequence fragments 
on the basis of sequence overlap* into unique genes. In addi- 
tion to complete sequences of some welkknowri genes, there 
are thousands of uncharactertted EST fragments that are 
found as a result of the clustering process. These unchaxacwr- 
lzed EST fragments are thought to represent previously un- 

characterUed genes. * 

Recendy> concerted efforts have been formed to con- 
struct a diverse set of cDNA libraries derived from various 
tissues in normal and diseased states. These libraries are 
heavily influenced by the National Cancer Institute s Cancer 
Genome Anatomy Proiect <CGAP>, ^^^^ 
characterize normal, precancerous, and maUgnantceUs. These 
libraries, when combined with the UniGene coUecnon pto- 
vide a set of gene egression data that can be analyzed to 
identify groups of coexpressed genes, - " 

In general, EST sequencing is more accurate but substan- 
tially more expensive than DNA rotaoarxays. DNA nricroar- 
ravs suffer from cross-hybridization, uncertain linearity, and 
numerous other technical problems. However, they are al- 
" most always more sensitive and cheaper to use. (The sensitiv- 
ity of EST sequencing is limited by the depth of sampling fcito 
the library. The costs increase linearly with the depth of se- 
quencing.) However, EST sequencing provides data that is eas- 
ily comparable across different experiments. EST sequencing 
"provides absolute numbers .that represent a sampling of the 
mRNAs present in a tissue sample. — 

The specific purpose of this analysis is to 
tional modules frorn^gene expression data ty*^***S 
croups of genei that are «5expressed. Our hypothesis is that 
coexpression has factional significance. Our approach of 
anting gene expression,across all tissues (rather £an £st 
comparing pairs of diseased and normal tissue), offers two 

5^ ! acro»itt tbgemer^eideriu^ 
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Second, modi** of genes that have a *P^ c h ^^ 
function In, for Instance, the development of the fetal brain 
but also play a pathologic role in. for sample, some forms of 
cancer due to mutation* that lead to misregulation may be 
identified. 

Although dhBST (Benson et al. 2002) contains sequence 
data from many different laboratories, the vast nU1<>W«: 
sequence analyzed In this study is derived either ™m the 
^^hington University/Merck EST project (HUller et al. 1996) 
cr the National Cancer Institute's Cancer Genome Anatomy 
PiOje^(CGAP)^ of ^ as is provided in dbEST, is 

insufficient for large-scale gene expression analysis, A Cluster- 
mg(orassembly)oftheE5nsm^ 

At least two databases have been established to do this The 
fiist, Uniqene (Wheeler et al, Z0Q2), is regularly updated and 
available freely onllfie. The second, TIGT* Human Gene Index 
(Quakenbush et al. 2000), i* available with a restrictive li- 
cense The data presented in this Study is based on the analy- 
st of gene oqpMWi data compiled in the UniGene database. 



: - ft* identic diltenttally expressed .gwi«*.te. f ■ 

sues, or groups of tissues). This is most simply 

PUbert Exact test, but can also use more 

methods (Avdic and Claverie 1997; Steketet al. 2000)^ The 

second class of algorithms seeks to identify groups of genes 

with similar patterns of expression across many (hundreds) 

different experiments: We need an algorithm from this sec- 

^^Mmydiff^app™^ to 
and organize gene expression data. Most algorithms wvaria- 
rC ; on Eisen% approach (Eisen et al. ^J^^^t 
data on the basis of Similarity of gene e ^ ie ^^t^S* 
approach, guUt-by-associaaon (GBA) was described by Walter 
et ^1999) and used to identify novel genes with expression 
patterns similar to those of known disease-related genes in a 
private (Incyte's LlfeSeq) gene expression data set generated 
from ESTS- In this work, we used th* GBA algorithm to iden- 
tify genes with similar expression patterns. ^ 
In this study, we first constructed a large set of human 
expression data from dbEST and UrtiGene. The data set was 
found to be reliable on the baste of its identification of ubiq- 
uitously expressed genes and known coexpressed genes. 
Then, we analysed this data set to identify modules ^ex- 
pressed genes. The analysis identified potential functional 
modules! One was Identuledin the Utcra^as a n^., 
expressed during pregnancy Cmompson et al. 1990), further 
confirming the reliability Of the data set. We also identi- 
fied a set of coexpressed genes that may form a novel func- 
tional module. The members of this functional module are. 
not well studied in the literature. We examined gene expres- 
sion in several cell lines to confirm that these genes are coex- 
pressed* 

RESULTS 

After constructing the gene expression data set, 
amined ubiquitously expressed genes as a check on the valid-- 
^V^, set^ Genes that showed the widest express^ 
(found in the greatest number of libraries) are shown i in Table 
1. No genes appear in more than one-thitd of the libraries. 
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The results of oui analysis tot coexpressed genes Tevealcd 
both known relationships and unknown relationships. We 
hiffhlleht one of the known relationships 3* anecdotal eyi- 

10 aene* with expression patten^ most similar to pregnancy 

are a group of pioteim that are found in large 
plawnwr^ompsonetal. 1990). Theyare all l<^ted on chro- 
mosome 19 and share common transcriptional regulatory el- 
ements (Thompson et al. 1990). . . , 

A postulated functional module is shown in Table 3. 
These five eenes are all shown with their compression relative 
to recently described transcription factor (Oettgen et 

al. 2000). To better understand the role of this module we 
oxamined the tissues in which it was expressed. We found the 
module to be expressed in some breast (Table 4) and ovary 
- (Table 5) tissues. The module Is also expressed in several other 
brain, lung cancer, and bone marrow tissues (Table 6). 

Many other significant relationships were found be- 
tween pairs of genes. Significance of a relationship was de- 
fined, following Walker et al. (1999). by a P-value, indicatmg 
the probability that the compression was dueto chance. Over 

of these P-values is shown in Bgu*e 1 along with the approxi- 
mate range of P-values for the functional modules described 
in this work. There is not necessarily a dear relationship of 
what constitutes a module. The structure of Hie data is more 
like a complex web then a group of distinct modules and our 
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definition of which genes belong to the PDEF module is fi*oV> 
ably not corr^lete^ ^^^^ of COC xprtSSion, we per- 
formed quantitative reverse transcriptase real-time^ PGR on 
four different cell lines derived from mammary epitlieliM trt- 
sue. We measured the quantity of mBNA ftomPDEF and the 
five other genes Shown in Table 3, along with a control, 
GAPD. Results are shown in Figure 2. 

DISCUSSION 

Examination of the most ubiquitously expressed genes (Table 
1) provides mteresting insight into the typical population of 
ceUular mBNAs. As expected, most of these mRNAs code for 
well-known ubiquitously expressed proteins (components of 
the ribosome, structural ptottins; and enzymes). However, 
one protein labeled tumor protein, bartslationally controlled 
1 or TFT1, is prominent due to its obscurity in the literature. 
Although Its name implies association with cancer, itis better 
described as a histamine releasing factor (WD an ?^^ 
studied as playing 4 role in the allergic response of mast cells 
that results in the exocytosis of histamine. However, its ubiq- 
uitous nature perhaps indicates it plays a much larger role m 
cellular Signaling. In fact, the recent^ JW 1 ^?^^ 
structure of this protein (Tbawet ; al,2001) revealed si^iflcant 
similarity to Mss4, a small GTPase *?»W^J^^ 
in amplification of extracellular signals. Our finding that 
TO is ubiquitously expressed is Supported by other works. 

'"jJukk \ Tc« jptf^ ! Mi^li..fcr USnH^BL ■ 4 : 
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One study used SAGE to exhaustively enumerate the papula- 
tions of mRNAs within colored cancer cell lines (Velculescu 
et aL 1999) and found TFT1 to be among the «ott abundant 
mRNAs In those cell line*- A second study found a yeast ho- 
moloj! to TPT1 to be one of the top 20 most abundant pro- 

•tQO"7\ 

The iceliablUtjr of the d?ta set was further confirmed by 
ex.min^^es coexpressed with PSG1. The human preg- 
nancy^Kycoprotetns(^G5) ^ 
that are dosely clustered on chromosome 19- This gr oup of 
TL, along ^th a ^ Othe« (CSHJ, C5H2) weie found in 
Lari* d^ved from placenta ntf fttal. ^?r s ^S£ 
one library derived from aorta (UnlGene Library Il> «2) >*» 
contained many of these genes. All of these genes are known 
to be specifically expressed In placental tissue*. . 

wValso identified member* of a novel funcnonal mod- 
ule. Six membert of this postulated *w£^7^J£ " 
repotted as follow*: PDEF, SQ5TM1, FIJ10111, H2AFO, 
]pSMP7 f and NUCKS. , ■' « 

PDEF wM first Identified as a prostate^ptthelium-specinc 
Ets transcription factor that interacts with the androgen re- 
ceptor to activate the promoter of the welMoiown prostate 
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identified as a positive regulator of maspln, a protease nnioi" 
'toi'thatis down-rtguUtedSn advanced breast cancers J* 
at 1994). Uter tavestlgatott identified PDEF as a candidate 

that PDEF mBNA'.was highly overexptessed In 14 of .20 pn- 
mary human breast tumor, exatnined. ^.*^?««*. «g* 
orient with metastatic bt^tcancer had 
19U<M higher in Wood as compart with noimal toilvidu. 
als (Ghader^nl and Sood 2001). Ah#R» ■*» 
teen Identified as being highly up-regulated in breast ancet 
«U Bnes (Fig- 1) and breast WHnS «****£^f&~' 
2001), only tWo wgets [the PSA promoter (Oettgen^t aL 
2000) and the maspin promoter (Yamada et aL 3O00)l ha»e 
teen identified that PDEF regulates. Little is known about 
PDEF's funcnon. elmer in breast cancer or in normal tissue. 

The class of Ets transcrtpnbn factors is characttrized by 
fte ETS domain, which binds nwhomericaUy to an 8-bp long 
DNA element (Ditt»« and Nordheim 1998). ^"J."* 
sctiptlon factors bind to the GGAA core sequence of UNA, but 
PDEF ptefers a GGAT core (Oet^en et al. 2000). 

P62 is a widely expressed cytoplasmic protein tnat « 
known to play a role in celhuar signaling by Interacting wtth 
Se SH2 domain of p56(l*) (Park et «L 199S) P62 was atoo , 
identified a* a ubiqulun-blndtag protein through tytaAtim- 

been found to fo,m cytoplasmic structures cal ed mtrahyaUne 
bodies that serve as a storage : place to muluubUjujnnated 
piot«r» (Shin 1998). "The ability of p62 to bind noncovalenfly 
to ublquitin and several signaling proteins, suggests that n«2 
msy play a regulatory role connected to the ubiquttin- 

P^"^™^^:^' the jQjTM^gene lias beei^charactei- 

ized previously (Vadlamudi and Shin 19«. ™ 
along with the DNA-binding studies of Oettgen et. al (2000), 
Sed us to Identify mo ;P oten«al >DNU«« 
within the SQSTM1 promoter and to postutete ftat PDFS 
regulates the SQSTlrfl prorwottir. ^e have peifpnneclwmsfec^ 
^ experiments showing tnat PDEF activates OieSQSTMpro- 
Cei cSowpson et al. ^2) threefold Wrbasal leveh 

The hypothetical protein FLJ10111 was Hist i<tenafied 
andnamedby theNedo FulHength cDNA Sequencing Pro,ect 
in japan. Although the message for the gene has been found 
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Involved in ^ _ . 

FL/lOiH is locate w a tightly packed region of chro- 
mosome I4thatcontain 5 otheiprtteasoine-Mlateq ?^<™- 
wata et aL. 2001). This 35-kb region contains Six different 
senes, two of which tPA23« and PA28fi) code for cxrniponents 
of the proteasome activator (PA2S), and txansripttm of genes 
in this region Is Induced by ^-interferon _ suggesting a mode 
of regulation. The PA28 proteasome a ctiva«W is taught «> 
bind to the 20 S proteasome and enhance the generation oi 
major histocompatibility complex <MHQ class l-bindm* pep- 
SiL et iL 1992; Yawata et al 2001). The products of the 
other foui genes in this 35-kb region ate not well character- 
Sed The predicted protein encoded by M1011X bu two 
KING finger domains, which are characteristic of E3 ublquitin 
ligases (Yawatt et *L 2001). The RING fingermotif is ***** 
many <>200) proteins in different eukaryotes, but not in any 
prokaTyotepKJteim(Fr«mom2000), ' 

Histories aie nuclear proteins responsible for organising 
genomic DNA into the tightly packed chromosome* within 
lykaryotic cells/ The histones play a role in eukaryotic tran- 
scriptional regulation and pbst-rxanslationaV modiacadons 
can change their DNA-bindlng properties. The transcnpt for 
H2AFO, a member of the H2A family, was found among this 
croup of genes, t _ mn , ^ .' 

The protein encoded by the PSMD7 gene, 512, is a regu- 
latory subunit of me proteasome. This protein i$ homologous 
to the mouse Mov34 protein. Mutations in Mov34 are lethal 
in the embryonic stage of development The protein is evo- 
lutionary conserved and homologs can be found in yeast, 
Drosophlla, along with the mouse. . 

NUCKS is similar (by sequence analysis) to the rat 
nuclear ubiquitous casein kinase 2, which was first isolated 
from both HeLa cells and rat brain and subsequently found to 
be ubiquitously expressed (Ostvold et aL 2001). The protein 
loaves to the nucleus andbinds single- and double-stranded 
DNA, It is phosphorylated at multiple sites by several kinases, 
mdudingprotein kinase C and casein kinase 2 (Oswold et al. 
1985, 1992; Walaas et aL 1989). - 

This approach to identifying functionally related mod- 
ules of genes has wide applicability. The libraries that com- 
mit the data set we examined were largely populated by 
Soi sequenced under the support of the CGAP pro- 

Eram which is focused on cancer. Hence, it is not surpnsing 
mat we identified a cancer-related module. We expect that 
concerted expression profiling of other diseased tissues w*U 
uncover functional modules active in those diseases. This 
method, however, requires a discovery-based (approach to 
studying molecular interactions. We queried the data set to 
examine some well-known genes and there is no information 
available on these. This may be ^xsmjb^m^ 
rarely (rather than abundantly) expressed mRNAs and tht 
paucity of sequences that were read in each library. 

The hypothetical Functional module that we identified 
in this study has members in two distinct classes, transcrip- 
- tional control, (H2AFO, VDEt, and NUCF^) and ubiquiun- 
proteasome pathway {FSM^ SQSJM1, and FlJlOlll). The 
obvious hypothesis that this leads to is that members of the 
fust class control the transcription of the second class and 
that this occurs in some diseased tissues. We have shown that 
: PDEF acU^ 
■ ^ ?Q$nn, p62, i5^oveie%ess#^ 

-^;:^%e.*o 'not^^l ■breast teue^fmcmpsort et-^.v-^2). -.X, ^- 



i^-cnle gene V^nstt* ^data: We coii^med ^M***- , 

genes are cc-expressed ^:^^^: iMMa ^^^ 
niRNA in cell lines derived from breast tissue. Published in- , 
formation on the function of the products of these pnes 
leads to a hypothesis about how these are functiortally re- 
lated. This work focuses on a small subset of the available 
data. The approach has wide applicability and could lead to 
the identification of many more functional modules. 

METHODS 
Computation 

We scanned the Homo Sapiens UniGene data set<build#146 

and compiled a Ust of libraries to which each UniGene zp- 
pears. Some appear multiple times to 
Screened out afi hbraries that had less than lM UmGmw 
associated with it. The final data set that we analyzed con- 
sisted of expression information for 96,574 genes across 1573 
libraries. ^ ^ expression data by using ^guUt-by- 
association (GSA), a coitt^natoric measure of ,^ a "%^ 
tween me expression, patterns of , two genes C^ker et a 
1999). We calculated the pairwise expression similarity be- 
tween aU 96,574 genes (4-7 billion comparisons). 

Cell Culture : - V'- \ ;" 

Four mammary epithelial cell lines were obtained from ATCC 
and exar^i^f« mRNA levels. MCF-7 cells were derived 
oriKintllyfrom an Sdehocardnoma. These cells are esttogen- 
rece^ pOMtive, HCCWZS^lis we* W f^Juiv^ 
metastatic adenocarcinoma. ™^ nt ^ d ^"LTnla 
of breast cancer- The cells are HerZ/neu negaave^and P sa 
necartve! MCF 10-2F and MCF-10-2A cells are both nontu- 
mffiic cell lines (they do 

oiesscd mice) that were denved from a 36-year-olo: remate 
w^th^brocystic breast disease. The MCF-10-2F celh are de- 
M*l f£mRoating «lls, whereas the MCF-IO-ZA ceils are de- 
rived from adherent Cells. , ^ A . . 

Mammary Withellrt cells v^ollfl^ In monotaymm 
iO^m 1 cultute dishes in the recommended medium supple- 
mented with chelated or unchelated hotse ra.feal b<j»me 
SasVppUcabte, until <(Mm influent. Ten dbhes^ «J- 
tu«d cells or -5-10 x 10 7 «Us, «ert solublllzed inTraKe- 
SSS?«« manuiactuie* i«*tm«0^ RNA ^Wrttoj, 
oJecitiitation, and solubilization were S*rformed ai described 
SrSEminftcbw. A total of 0.25 mg BNA «as aliquoted for 

oTpo^A) RNA from each «U lme Cultt^ ujtag . . 
SDin-coLmn fo«iat (Q1AGEN) per the manufacturer's in- 

ing tte.Hbddnm RKA Quanatation Reagent <M°!*«^« 
ftobes) in !>6-weU plate format, assayed ta ^tnpllcate. ^g the 
Fluorodsan Accent -EL combination 

(LabSystems). A total of SO ng m?NA was used to^ \^^} 
iisin/ the ThenttOSCtlpt RT-PCR System (GIBCO-BHW. 

» |we«tei)NA per the manufeftuTefS instruc- 

dons, v ■., *, - C^'-' 

Real-Time PCR ^ i 5 ^ 

Prior to assaying concentrations of mRNA species from 
sTtDDles by real-time quantitative PCR. several preparatory 
f^wete^qul^ FiSt, separate ^fi^^S 
erased for each gene to be analyzed. Genes to ^ a ™3™ 
were amplified fro man :^m^^.' n V^±'^^, 

from ffene s^tjenwavailab^ ™" 



Recei¥8di<415 852 0195 > at 11/13/026:1 M [Eastern Standard time] 



mm 



Thompson et ah 



W/}^:<^^& -&*2 orcteis of ■liw^twte generated atid as- 
" ; ; ' saved irvdu plicate with each 'iatttple: 1 ■ ... - . ■">■ 

Primers used in the real-time PGR reaction were designed 
from sequences Internal to those used to amplify the cDNA 
for the Standard curve/ with care taken to encompass all 
mRNA variants reported, in case multiple cDNAs w ^^*?^"^t 
ated byRT-PCR. In addition, real-time primers were desigjed 
to come from two different exon* to eliminate the pos^bility 
of cenomiC DNA and immature RNA contamination. Results 
- from real-time PCK are collected as A""?^^**^ 
number. By establishing a fluorescence threshold, a linear 
ffyaph Is generated of the standard curve using log (concen- 
tration) trto tied as a function of cycle threshold number. The 
equation of the line is then used to determine the starting 
concentration of mRNA for each unknown. For each assay, 
unknowns are assayed in triplicate with a new standard curve. 
Sample are normalized to total mRNA but a housekeeping 
gene, GAPD, was also quantitated. 
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